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Abstract. Our focus was to design harvestmen-specific PCR primers to target both introns and exons of the nuclear 
protein-coding gene Elongation Factor -1 alpha (EF-la). We tested this primer set on ten genera representing all primary 
lineages of Opiliones, with sets of close phylogenetic relatives (i.e., sets of several congeners) included to specifically assess 
utility at shallow phylogenetic levels. Our research also included the collection of parallel mitochondrial protein-coding 
DNA sequence datasets for the congeneric sets to compare relative rates of evolution and gene tree congruence for EF-la 
versus mitochondrial data. The harvestmen primers resulted in successful amplification for nine of ten tested genera. Exon 
sequences for these nine genera appear orthologous to previously-reported EF-la Opiliones sequences, which were 
generated using RT-PCR methods. Newly-generated exon sequences are interrupted by three separate spliceosomal introns; 
two introns are restricted to one or two genera, but a third intron is conserved in position across all surveyed genera. 
Phylogenetic analyses of EF-la nucleotide data for congeneric sets result in gene trees that are generally congruent with 
mitochondrial gene trees, with EF-la phylogenetic signal coming from both intron and exon sites, and resolving apparently 
recent divergences (e.g., as recent as one million years ago). Overall, the combination of gene orthology, conserved intron 
position, and gene tree congruence at shallow levels suggest that this gene region will prove generally useful for both 
phylogeographic and species-level phylogenetic analyses in Opiliones, complementing already-documented utility at higher 
taxonomic levels. 
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Both phylogenetics and taxonomy obviously have been 
impacted by advances in molecular biology, and most modern 
systematic analyses published today include some molecular 
component. In the arachnid order Opiliones, several molecular 
systematic studies have been published that consider relatively 
ancient phylogenetic divergences (e.g., Giribet et al. 1999, 
2002; Shultz & Regier 2001). Fewer studies have considered 
molecular phylogenetic divergence within a species, or 
between closely related species, and those published have 
relied mostly on mitochondrial DNA sequence data. While 
informative, a mitochondrial-only perspective has limitations 
(Ballard & Whitlock 2004; Rubinoff & Holland 2005), and is 
quickly becoming obsolete in a molecular systematics world 
dominated by multigenic datasets. 

A handful of nuclear genes have been used to address recent 
divergences within Opiliones. Those utilized offer a mix of 
pros and cons. Thomas & Hedin (2008) used ribosomal ITS 
sequences at the intraspecific level in Fumontana deprehendor 
Shear (Laniatores), and although well-behaved in this case, 
this gene region has been shown to be problematic in other 
arthropod taxa because of a lack of concerted evolution (e.g., 
Harris & Crandall 2000; Bower et al. 2009). Ribosomal 28S 
sequences have been used in various harvestmen taxa, but in 
our experience these data show limited variability at the 
shallowest levels (e.g., see Hedin & Thomas 2010). Shultz & 
Regier (2009) used reverse transcriptase polymerase chain 
reaction (RT-PCR) methods to generate exon nucleotide data 
for the nuclear protein-coding genes EF-la and POL II, and 
applied these data to species-level relationships in the genus 
Caddo. While these data are informative at shallow levels, 
gathering the data requires the extra cost and expertise needed 
to conduct RT-PCR. Finally, Sharma & Giribet (2009) used 


three novel nuclear genes (H3, H4, U2 snRNA) to address 
relationships in sandokanid laniatoreans. Although the focus 
was at slightly higher taxonomic levels, several congeneric 
comparisons were made. Again, despite the clear utility in the 
multigenic perspective used in this study, these gene fragments 
individually are small (330, 160, 130 basepairs (bp), respec¬ 
tively), and our examination of these data suggests minimal 
variation among congeneric species. 

Here we assess the phylogenetic utility of the nuclear 
protein-coding gene Elongation Factor-1 alpha (EF-loc) for 
resolving relatively shallow opilion phylogenetic divergences. 
In metazoans, EF-la is a core gene involved in protein 
synthesis - this fact facilitates use in molecular phylogenetics 
(e.g., the gene is expected to be found in all taxa, have 
conserved function, etc). This gene is commonly used in 
hexapod systematics (summarized in Caterino et al. 2000), and 
has also seen limited use in arachnids (Shultz & Regier 2001, 
2009; Hedin & Maddison 2001). Given our interest in 
resolving recent divergences, we expect most phylogenetic 
information to come from either silent substitutions at third 
codon exon positions (e.g., Cho et al. 1995; Reed & Sperling 
1999), or from introns (e.g., Danforth et al. 1999; Hedin & 
Maddison 2001). Here we present the results of a commonly 
employed strategy: to develop PCR primers that reside in 
relatively conserved exons, but span variable introns that are 
conserved in position (see Palumbi 1996). We have discovered 
several EF-la introns in Opiliones, some of which are 
universally conserved in position, but variable at shallow 
phylogenetic levels. The primers developed permit consistent 
and robust amplification of a fast-evolving gene region using 
standard PCR. methods, allowing potentially rapid collection 
of population-level samples (e.g., many individuals from 
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Table 1.—PCR primers. Primers marked with an asterisk were designed after sequence collection using more general primers. The relative 
position (POS) of primers (5' end) is shown based on alignment with exon sequences of Shultz & Regier (2001). Ambiguity codes are standard. 

POS 


Forward Primers 
EF1-OP1 
EF1-OP2 
EF1-OP2SCLER 
*EFl-OP2ASAB 
*EFl-OP2BSAB 
EF1-OP3 
EF1-OP3PHAL 
EF1-OP3SCLER 
EF1-OP4 
EF1-OP4SCLER 
*EFl-OP5LEIO 
EF1-OP5PHAL 
EF1-OP5LAN 
EF1-OP5ISCH 
EF1-OP6PHAL 

Reverse Primers 
*EFl-OP5LEIORC 
EF1-OPRC1PHAL 
EF1-OPRC1SCLER 
EF1-OPRC2 
*EFl-OPRC2LEIO 
EF1-OPRC2PHAL 
EF1-OPRC3PHAL 
*EFl-OPRC3LEIO 
EF1-OPRC3SCLER 
EF1-OPRC4 
*EFl-OPRC4LEIO 
EF1-OPRC5PHAL 


5' -CGT GGTATY ACC AT Y G ATATCAC -3' 28 

5' -GATTTCATCAARAACATGATYAC -3' 112 

5' -G ATTT CATC A AG A AC AT GATT AC -3' 112 

5' -GCTGTGCTTATTGTTGCTGCYGG -3' 157 

5' - GGTACTGGTGAGTTTGAAGCTGG-3' 178 

5' - TTTGARGAAATCCARAARGAAGT-3' 322 

5' - TTTG A AG AA ATCC A A A AGG A AGT-3' 322 

5' - TTTGAGGAAATCCAGAAGGAAGT-3' 322 

5' - TACATYAAGAAGATTGGTTA-3' 352 

5' - TACATCAAGAAGATCGGTTA-3' 352 

5' - GG AG AT A AC AT GTT GG AAC A A AG-3' 415 

5' - AAC AT GTT GG A AC A AAGT ACCCA-3' 421 

5' - AACATGYTGGAAGCTTCCTCC-3' 421 

5' - AACATGTTGGARGCCAGYGC-3' 421 

5' - CAT C ACC ACT G AAGTT A A AT CT G-3' 672 

5' - CTTT GTT CCA AC AT GTT AT CT CC- 3' 437 

5' - C AG ATTT A ACTT C AGT GGT GAT G -3' 692 

5' - CGGACTTGACCTCAGTGGTGATG -3' 694 

5' - GANACGTTCTTNACRTTGAA -3' 767 

5' - GAAACGTTCTTAACATTGAA -3' 767 

5' - A CG G A A ACGTT CTT A A CGTT G A A -3' 770 

5' - ATGACCTGGGCRGTGAATTCTTC -3' 854 

5' - AT A ACCT GGGC AGT A A ATT CTT C -3' 854 

5' - ATGACCTGAGCCGTGAACTCTTC -3' 854 

5' - G A ACTTGC ANGCA AT GT G AGC -3' 935 

5' - GAACTTGCAAGCAATGTGAGC -3' 935 

5' - GGTT GT CTT CCA ATTT CTT GCC -3' 992 


multiple populations from many species). We present this 
primer set as a resource to be explored, modified, and utilized 
by other opilion systematists. 

METHODS 

Primer design. —Shultz & Regier (2001) generated nearly full- 
length EF-la exon nucleotide sequences using RT-PCR methods 
for a comprehensive sample of Opiliones. These sequences were 
downloaded from GenBank, then compiled and translated in 
MacClade 4.08 (Maddison & Maddison 2003). We designed 
primers in conserved exon regions that might also amplify 
introns, using known intron positions from spiders and 
hexapods as a guide (Table 1, Figure 1). Within conserved 
regions, we manually (i.e., without software) designed 20-25 bp 
oligonucleotides in regions with approximately equal base 
composition, and GC-rich 3' nucleotides corresponding to 
strictly conserved second codon positions. Most primers were 
designed to work for all Opiliones, but in some cases we also 
designed clade-specific primers (e.g., EF1-OP3PHAL is a primer 
that best matches the phalangioid sequences from Shultz & 
Regier 2001, etc.). We designed additional internal, clade-specific 
primers after initial sequencing with our more general primers. 

Samples. —We tested primer utility on ten genera (Table 2) 
that represent all primary higher-level clades within Opiliones 
(Fig. 2). Specimens were either preserved directly in 100% 
EtOH in the field, kept cold, and later transferred to a -80° C 
freezer (or -20° C freezer in the case of Caddo), or preserved 
directly in 100% EtOH in the lab and stored in a -80° C 


freezer. All specimens are currently housed in the Arthropods 
Genomics Collection at San Diego State University. We 
extracted genomic DNA from leg tissues using the Qiagen 
DNeasy Kit; these extracts were either suspended in 200 mi¬ 
croliters (pi) of Qiagen AE buffer, or dried down using a 
Speedvac and resuspended in a smaller volume of AE buffer. 

Testing primer utility. —For some samples (Siro, Caddo , 
Ortholasma, Dendrolasma and Bishopella), we assessed primer 
utility in a preliminary manner. Here we conducted PCR 
experiments to test whether our primers resulted in amplicons 
of expected size, and if so, we directly sequenced these 
amplicons. These sequences inform us about primer success, 
intron position, and relative intron size, but do not address 
variability of the nucleotide data within a taxon (see below). 
We conducted PCR experiments using various forward/re verse 
primer combinations (Table 2). All PCR reactions included 
0.8 pi of genomic DNA with 0.08 pi Ex Taq polymerase 
(Takara Bio Inc.), and 2.5 pi each of Takara dNTP mix, 
Takara buffer (with Mg 2+ ) and primers (at 2-4 pM concen¬ 
trations). We used tissue culture water (Sigma-Aldrich Co.) to 
bring reaction volumes to 25 pi. Experiments were conducted 
on two different PCR machines, including a MJ Research 
PTC-100 (95° C, 2 min; 35X (95° C, 30 s; 53° C, 45 s; 72° C, 
45 s); 72° C, 10 min) and an Eppendorf Mastercycler Personal 
(94° C, 3 min; 35X (94° C, 45 s; 45° C, 45 s; 72° C, 1 min 
15 s); 72° C, 5 min). Amplification products were visualized 
on agarose gels, purified via polyethylene glycol (PEG) 
precipitation, and sequenced directly at the SDSU Micro- 
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EF1-OP1 ► 


EF1-OP2 ► 

EF1-OP2ASAB ► 

EF1-OP2BSAB ► 


Forward Primers ► 

EF1-OP3 ► 


EF1-OP5LEIO ► 
EF1-OP5PHAL ► 


EF1-OP4 ► 


Apis F2, beetle C1/C2 introns 

Apis FI, Dros F2 intron 
EF1-OP6PHAL ► 


A EF1-OP5LEIORC 

4 Reverse Primers 


Spider intron II 


Spider intron I 


A EF1-OPRC1PHAL 

◄ EF1-OPRC2 


A EF1-OPRC4 


◄ EF1-OPRC3PHAL ◄ EF1-OPRC5PHAL 




Siro 

▼ 



Sclerobunus/Cyptobunus 

▼ 



Bishopella 

▼ 

▼ 

1 


▼ 


I 

Sabacon 


Intron 1 

▼i 

Ortholasma 

T 



Dendrolasma 

T 



Acuclavella 

▼ 


▼ 


▼ 



Leiobunum 



Intron II 

Intron III 


Cyphophthalmi 

Laniatores 


Dyspnoi 


Eupnoi 


Figure 1.—Upper: Relative primer positions (see also Table 1). Spider intron positions from Hedin & Maddison (2001), hexapod intron 
positions from Brady & Danforth (2004). Small circles designate 100-bp intervals. Lower: Amplicon sizes and relative intron positions for opilion 
sample. Small hashes above amplicon boxes for Ortholcismci and Sabacon indicate alternative 5' primers used for some templates (see also 
Table 2). 


chemical Core Facility (http://www.sci.sdsu.edu/dnacore/ 
sdsu_dnacorc.html) on an ABI Prism 3100 capillary machine. 

Both strands were determined for most templates using 
PCR primers in sequencing reactions. For some Leiobunum 
templates, we used additional internal primers for sequencing. 
We assembled and edited sequence contigs using Sequencher 
version 4.5. For some taxa (see Results), AT-rich regions (and 
possible allelic length heterozygosity) within introns caused 
premature stops in some sequencing reads; sequences for these 
templates were thus determined using single reads from 
opposite sides of the intron. Also, for Acuclavella we were 
able to generate high-quality 780-bp sequence reads for the 
entire exon plus intron amplieon, and thus used only a single 
sequence read for several of these templates. Sites with two 
peaks of equal intensity on chromatograms were interpreted as 
representing nueleotide heterozygosity, and scored as such in 
data matrices using standard ambiguity codes. 

To assess phylogenetic utility at shallow levels more 
rigorously, we collected EF-la sequences for sets of close 
phylogenetic relatives (both within and between species) for 


Leiobunum , Sabacon , SclerobunusICyptobunus, and Acuclavella. 
For these same taxon sets we also collected parallel mitochon¬ 
drial sequence datasets, allowing us to gauge the relative 
variability of nuclear versus mitochondrial gene data, and ask 
if independent phylogenetic analyses resulted in generally 
congruent results. For Sabacon , SclerobunusICyptobunus, and 
Acuclavella, we collected partial cytochrome oxidase I (COl) 
mitochondrial sequences, following standard techniques as in 
Thomas & Hedin (2008). Forward primers included Cl-N-1510 
or C1-N-1718S, combined with Cl-J-2568 or C1-J-2776S (see 
Hedin & Thomas 2010). For Leiobunum , we collected nearly 
full-length mitochondrial ND1 sequences using the custom 
primers LR-N-12945LEI (5'- TGACCTCGATGTTGAAT- 
TAA -3') and CB-J-11638LEI (5' - CCTWATAAACTAAT- 
CATTTAGC - 3'). 

We conducted phylogenetic analyses at two different 
hierarchical levels. First, we conducted an exon-only analysis 
that included the Shultz & Regier (2001) exon data, plus 
newly-generated exon-only data (i.e., introns removed). This 
exon matrix was aligned manually, and subject to a heuristic 
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Table 2.- Taxon sample, including voucher and collection location information, mitochondrial sample, and successful EF-la primers. 


Taxon 

Voucher 

no. 

Collection location (N lat., W long.) 

mtDNA 

EF-la primers 

Sironidae 

Siro cf kamicikensis 

OP 2350 

Idaho: Idaho Co., FS 311, S Route 14 (45.6853, 
-115.5427) 

no 

OP3, OPRC4LEI 

Caddoidea: Caddidae 

Caddo agilis OP2565 

Phalangioidea: Sclerosomatidae 

New Hampshire: Cheshire Co., Pisgah State Park 
(42.862, -72.428) 

no 

failed 

Leiobunum aklrichi 

OP 829 

Mississippi: Tishomingo Co., Tishomingo State Park 
(34.6054, -88.1927) 

yes 

OP2, OPRC4LEI 

L. aldriclii 

OP 1069 

Missouri: Calhoun Co., Marshall (42.3012, 

-84.9674) 

yes 

OP2, OP5LEIORC, OPRC4LEI 

L. vittatum 

OP 835 

Tennessee: Cumberland Co., Cumberland Mtn SP 
(35.9013, -84.9958) 

yes 

OP2, OP4, OP5LE10RC, 
OPRC4LEI 

L. speciosum 

OP 1405 

Tennessee: Davidson Co., west Nashville (36.1227, 
-86.9061) 

yes 

OP2, OP4, OP5LEIORC, 
OPRC4LEI 

L. calcar 

OP 1394 

North Carolina: Clay Co., Big Tuni Creek (35.1025, 
-83.7007) 

yes 

OP2, OP4, OP5LEIORC, 
OPRC4LEI 

L. calcar 

OP 814 

Tennessee: Cocke Co., road to Cosby CG (35.7633, 
-83.2115) 

yes 

OP2, OPRC4LEI 

L. nndescribed species 

OP 1383 

Virginia: Grayson Co., Grayson Highlands SP 
(36.6247, -81.5013) 

yes 

OP2, OPRC4LE1 

L. serratipalpe OP 1080 

Troguloidea: Nemastomatidae 

Maryland: Montgomery Co., Unity, Tusculum Farm 
(39.2427, -77.0872) 

yes 

OP2, OPRC4LEI 

Ortholasma rugosnm 

OP 807 

California: San Benito Co., NE of Pinacate Peak, off 
Hwy 101 (36.8602, -121.6126) 

no 

OP2BSAB, OP3, OPRC4LEI 

Ortholasma nigosum 

OP 808 

California: San Mateo Co., San Bruno Mtn SP 
(37.6943, -122.4529) 

no 

OP3, OPRC4LEI 

Dendrolasma mirabile OP 1000 

Ischryopsalidoidea: Ceratolasmatidae 

Oregon: Curry Co., Hwy 33, E of Gold Beach 
(42.5465, -124.1273) 

no 

OP3, OPRC4LEI 

Acuclavella cf. qnattuor 

OP2233 

Idaho: Idaho Co., Eagle Mt Trailhead, S side of 
Lochsa River (46.4292, —115.1335) 

yes 

OPRC4 only 

A. merickeli 

OP2251 

Idaho: Idaho Co., along headwaters of Red River, 
Red River Rd (45.7853, -115.2026) 

yes 

OPRC4 only 

A. qnattuor 

OP2255 

Idaho: Idaho Co., Tributary of Crooked Creek, FS 
Rd 222 (45.5791, -115.4431) 

yes 

OPRC4 only 

A. cosmetoides 

OP2278 

Idaho: Idaho Co., Canyon Creek Trailhead, US 12 
(46.2101, -115.5442) 

yes 

OPRC4 only 

A. cf. merickeli 

OP2345 

Washington: Jefferson Co., Cedar Creek, E of US 
(101 47.7105, -124.4095) 

yes 

OP2BSAB, OPRC4 

A. cf. merickeli OP2347 

Ischryopsalidoidea: Sabaconidae 

Washington: Lewis Co., tributary of Iron Creek, FS 
Rd 25 (46.4033, -121.9902) 

yes 

OP2BSAB, OPRC4 

Sabacon simoni 

OP2522 

Italy: Piemont, Prov. Cuneo, near Monesi di Triora 
(44.07381, 7.75028) 

yes 

OP2BSAB, OPRC4 

S. cavicolens 

OP721 

Tennessee: Anderson Co., near Norris Dam Cave 
(36.221, -84.09) 

yes 

OP2BSAB, OPRC4 

S. cavicolens 

OP657 

Virginia: Lee Co., Cave Spring Rec. Area (36.803, 
-82.92) 

yes 

OP2BSAB, OPRC4 

S. cf. cavicolens 

OP1283 

North Carolina: Macon Co., Bullpen Bridge (35.015, 
83.126) 

yes 

OP2BSAB, OPRC4 

S. cf cavicolens 

OP691 

North Carolina, Haywood Co., below Hebo Mtn. 
(35.686, -82.906) 

yes 

OP2BSAB, OPRC4 

S. cf cavicolens 

OP1290 

North Carolina: Transylvania Co., Looking Glass 
Creek (35.297, -82.767) 

yes 

OP2BSAB, OPRC4 

S. cf cavicolens 

OP660 

Virginia: Scott Co., Hwy 23/58, near Weber City 

no 

OP2, OPRC4 


(36.6341, -82.5604) 
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Table 2.—Continued. 


Taxon 

Voucher 

no. 

Collection location (N lat., W long.) 

mtDNA 

EF-la primers 

S. cf. cavicolens OP 1535 

Travunioidea: Sclerobuninae 

North Carolina: Avery Co., Henson Creek, N of 
Ingalls (36.0374, -82.0420) 

no 

OP2, OPRC4 

Sclerobunus 

nondimovpliicus 

OP 1056 

Oregon: Clatsop County, Ecola SP (45.9221, 
-123.9767) 

yes 

OP3, OP4SCLER 

Cyptobunus cavicolens 

OP2143 

Montana: Jefferson County, Lewis and Clark 

Caverns (45.8386, -111.8668) 

yes 

OPRC3SCLER; OPRC4 

Sclerobunus robustus 
robustus 

OP885 

New Mexico: Sandoval County, Jemez Mtns 
(35.8384, -106.4044) 

yes 

OP4SCLER, OPRC3SCLER 

Sclerobunus r. robustus 

OPl 149 

Colorado: Dolores County, along Dolores River 
(37.7679, -107.9871) 

yes 

OP4SCLER, OPRC3SCLER 

Sclerobunus r. robustus 

OPI122 

Colorado: Gunnison County, S of Gothic (38.9397, 
-106.4072) 

yes 

OP3, OPRC4 

Sclerobunus r. robustus OP 1164 

Grassatores: Phalangodidae 

Colorado: Custer County, Wet Mtns (38.1335, 
-1105.1791) 

yes 

OP4SCLER, OPRC3SCLER 

Bishopella laciniosa 

OP320 

Tennessee: Johnson Co., Backbone R.ock Rec Area, S 
Damascus (36.5948, -81.8173) 

no 

OP3, OPRC4LEI 

Bishopella laciniosa 

OP108 

North Carolina: Jackson Co., Upper Falls, 
Whitewater River (35.0336, -83.0141) 

no 

OP3, OPRC4LEI 



Sironidae (Siro) 

Caddidae (Caddo) 

Sclerosomatidae (Lelobunum) 

Nemastomatidae (Ortholasma, 
Dendrolasma) 

Ceratolasmatidae ( Acuclavella) 

Sabaconidae ( Sabacon) 

Sclerobuninae (Sclerobunus, 
Cyptobunus) 

Phalangodidae ( Bishopella) 


Figure 2.—Opiliones phylogeny with taxon sample. General tree structure follows Giribet & Kury (2007, fig. 3.4). The uncertain phylogenetic 
placement of Dyspnoi is indicated by a dashed line, representing a sister clade relationship to either Eupnoi (following Shultz & Regier 2001), or 
to Laniatores (e.g., Giribet et al. 1999). 
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parsimony search in PAUP v4.06 (Swofford 2002), using 
stepwise taxon addition from 1000 random replicates with tree 
bisection-reconnection (TBR) branch swapping. For the 
congeneric sets, we conducted exhaustive parsimony searches 
on both nuclear and mitochondrial matrices, conservatively 
treating EF-la intron gaps as missing. Branch support was 
assessed using nonparametric bootstrap analyses (Felsenstein 
1985), comprising 1000 pseudoreplicates of a branch-and- 
bound parsimony search for each congeneric set matrix. 

RESULTS 

Data availability, primer utility. —All sequences have been 
deposited to GenBank (accession numbers GQ870643- 
GQ870668; GQ872152-GQ872185). Both intron and exon 
alignments are available at www.treebase.org (study accession 
number S2469). 

Using various primer combinations (Fig. 1, Table 2), we 
were able to amplify and generate EF-la sequences for all 
surveyed genera, except for the caddoid Caddo agilis. We 
attempted eight different primer combinations for a single 
extraction from Caddo , using primers that worked well for 
other taxa. We suspect that the failure of this sample reflects a 
difference in sample preservation, as noted above. Overall, the 
most successful primers included the forward primers EF1- 
OP2, EF1-OP2BSAB, and EF1-OP3SCLER, combined with 
the reverse primer EF1-OPRC4LEI. The most 5' (EF1-OP1) 
and y (EF1-OPRC5PHAL) of our designed primers failed in 
all PCR experiments. 

Features of the data. —Newly generated sequences were 
manually aligned with the exon-only nucleotide data of Shultz 
& Regier (2001) to identify the reading frame, intron insertions, 
and intron/exon boundaries. As a preliminary assessment of 
sequence orthology, we first removed all introns. The remaining 
exon data translate to expected amino acids, and with a single 
exception (see below) can be aligned without gaps. A strict 
consensus tree resulting from parsimony analysis of the exon- 
only data recovers expected phylogenetic placements for the 
newly-generated EF-la sequences (Fig. 3), suggesting orthology 
of the gene copies used in this study. Ours is the first study to 
include the genus Acuclavella in a higher-level molecular 
analysis; we recover Acuclavella as sister to Ceratolasma , 
consistent with the hypothesis of Shear (1986, fig. 1). 

Patterns of exon variability across divergent opilion taxa 
(e.g., among representatives of primary clades) have been 
addressed previously (Shultz & Regier (2001). Patterns of exon 
evolution within congeneric sets are summarized in Table 3. As 
expected, almost all exon variation occurs at third codon 
positions, with a total of 116, 14, and 7 variable sites at third, 
first, and second positions, respectively. Heterozygosity at 
exon positions is minimal, with less than 10 total sites scored 
as heterozygous for all newly-generated sequences. 

We discovered three different introns in the harvestmen 
sample, here named introns I—III. The relative position of these 
introns, intron size, and intron/exon nucleotide boundaries are 
summarized in Figures 1 and 4. Intron I appeared only in two 
Sabacon samples (those amplified with EF1-OP2 primers; most 
Sabacon were amplified using downstream OP2BSAB primers). 
We also used the EF1-OP2 for Leiobunum (Fig. 1), but these 
sequences lack this intron. Examination of EF-la genomic 
sequences for the tick Ixodes scapularis (http://iscapularis. 


vectorbase.org/, gene ISCW020299) indicates that intron I is 
shared in position between Ixodes and Sabacon. Intron I is 
relatively small in Sabacon (less than 100 bp), and includes 5' 
(GT) and 3' (AG) splice site signal sequences (see Fig. 4) 
consistent with proposed metazoan consensus signal sequences 
(Senapathy et al 1990; Mount et al. 1992). 

Intron II was found in all Leiobunum and one Ortholasma 
sample (OP 807, amplified with OP2BSAB; Ortholasma OP 808 
was amplified using downstream primers). These taxa are not 
phylogenetic relatives, and differ considerably at the exon level 
(see Fig. 3). We did not detect this intron in Acuclavella and 
some Sabacon , despite using primers that spanned this region 
(Fig. 1). This somewhat random phylogenetic pattern of intron 
presence/absence is most likely the result of differential intron 
loss (a similar argument applies for intron I), although we 
cannot rule out independent intron gains (e.g., see Roy & Penny 
2006). Intron II varies in size (~ 100-200 bp), and to infer 
canonical 5' (GT) and 3' (AG) splice site signal sequences in 
Ortholasma requires a two amino acid deletion (see Fig. 4). We 
return to this inference in the Discussion. 

Finally, intron III is found in all opilion samples that we have 
sequeneed. This intron is shared in position with spiders 
( Habronattus , see Hedin & Maddison 2001), but not with 
Ixodes. Intron III varies considerably in size among harvestmen 
taxa, ranging from 60 to > 500 basepairs in length; all include 
canonical 5' (GT) and 3' (AG) splice site signal sequences 
(Fig. 4). 

We generated high-quality intron I sequences for the taxa 
reported here, but in other Sabacon taxa a long (6-bp) poly-T 
region proved difficult to sequence through (data not 
reported). This provided our motivation to design Sabacon - 
specific primers (OP2BSAB) downstream of intron I. Intron II 
includes at least two long poly-T regions, but we had little 
difficultly in sequencing through this intron for several 
Leiobunum and a single Ortholasma. Our sample size is largest 
for intron III, where there is considerable variation in intron 
size (see above), nucleotide composition, and sequencing 
difficulty. The short (Fig. 4) Acuclavella introns were not 
difficult to sequence through, although these possess both 
poly-T and poly-A regions. Also, despite the long Siro intron 
III, and several simple-sequence regions, sequence reads were 
clean. Other taxa presented mixed success, even within 
congeneric sets, where some samples resulted in clean bi¬ 
directional reads, whereas other reactions resulted in clean 
reads that failed abruptly at poly-A/T regions. The sequenee 
contigs for these latter samples comprise single strand reads 
from opposite directions, with minimal terminal overlap. 
Regions of terminal overlap that proved ambiguous to score 
were excluded from the phylogenetic analyses reported below. 

Comparative phylogenetic utility. —The protein-coding mi¬ 
tochondrial data (COl, ND1) translate to amino acids without 
stop eodons, and can be aligned without gap insertions. We 
compared these mitochondrial data to intron plus exon EF-la 
data for the four sets of close phylogenetic relatives. Introns II 
(Leiobunum only) and III (all taxa) were aligned manually, but 
for the six-taxon SclerobunusICyptobunus set, introns Irom 
two specimens (OP 1056, OP2143) were too divergent to align 
reliably. Here we conducted phylogenetic analyses on exons- 
only for the six-taxon set, and analyses of introns-only for four 
of six taxa. 
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Figure 3.—Exon-only majority-rule consensus parsimony tree (N = 920, L = 2760). Higher level clade names are indicated, following 
taxonomy of Pinto-da-Rocha & Giribet (2007). Sequences generated in this study are shown with OP numbers; those from Shultz & Regier 
(2001) are shown with associated GenBank numbers. The Sclerobiinus sequence (AF240858) generated by Shultz & Regier (2001) was originally 
misidentified as S. robustus - this sequence actually corresponds to S. nondimorphicus. 


Results of the nine separate exhaustive parsimony searches 
(three for Sclerobunus! Cyptobimus , two for all other congeneric 
sets) are summarized in Fig. 5. These comparative analyses 
reveal several patterns of interest. First, all trees are well- 
resolved, with all but one of the searches resulting in a single 
most-parsimonious tree. Second, there is considerably more 
divergence depth in mitochondrial trees as compared to EF-la 
trees, as indicated by differences in total tree length and 
individual branch length estimates. This reflects the well- 
documented higher rates of molecular evolution in animal 
mitochondrial genomes as compared to “average” nuclear genes 
(e.g., Moriyama & Powell 1997; Lin & Danforth 2004). Third, 
phylogenetic signal in the EF-la data is coming from both intron 
and exon sites. This is most-clearly illustrated in the Sclerobunus! 
Cyptobimus taxon set where we conducted individual analyses on 
exons versus introns, but is also seen when comparing intron 
versus exon substitutions for specific “shallow” pairwise 


comparisons (see Table 3, Fig. 5). Finally, analyses of the 
independently-evolving mitochondrial versus EF-la data result 
in generally similar gene tree estimates. Visual inspection 
indicates an overall pattern of topological and branch length 
similarity across data partitions. Moreover, we used the 
qualitative framework of Wiens (1998) to assess topological 
congruence. Among all trees, thirteen taxon bipartitions are 
strongly supported (bootstrap proportion values > 70; see Hillis 
& Bull 1993) by the mitochondrial data. Of these 13 bipartitions, 
10 are also strongly supported by the EF-la data (Fig. 5). 
Importantly, there is no evidence for “strong incongruence” 
(following Wiens 1998), where conflicting taxon bipartitions are 
strongly supported by the parallel datasets. 

DISCUSSION 

A general lack of genomic resources for Arachnida has 
hampered development of the type of multigenic molecular 
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Table 3.—Patterns of EF-la sequence evolution. Variable intron 
sites were only counted at unambiguously aligned positions without 
indels. The following samples were not included in intron variation 
counts: Sabacon (OP660, OP 1535, OP2522), SclerobunusfCyptobunus 
(OP 1056, OP2143). Values for specific “shallow” pairwise compar¬ 
isons (involving samples with mitochondrial divergence values 
between 2-3.2%) are shown in parentheses (see also Fig. 5). 


Taxon 

Variable Intron sites 

Variable Exon sites 

Intron II 

Intron III 

Pos 1 

Pos 2 

Pos 3 

Sclerobun i is! Cyptobunus 

- 

50 (10) 

4 

2 

44(1) 

Acuclavella 

- 

16(1) 

2 

2 

16(0) 

Leiobunum 

19(1) 

11 (3) 

3 

2 

20(1) 

Sabacon 

- 

9(1) 

5 

1 

36(4) 


systematics perspective that is now seen in many other 
organismal groups (e.g., see Brito & Edwards 2008). In 
Opiliones, a core set of genes have been used at higher 
taxonomic levels (e.g., various mtDNA genes, 18S, 28S, 
histone 3, EF-la), but nuclear gene choices at lower levels 
are extremely limited. We need to develop additional rapidly- 
evolving, informative nuclear genes for harvestmen. These will 


allow more accurate reconstructions of population and 
species’ history, providing the necessary framework for 
addressing “shallow” systematic questions that abound in 
this diverse group (e.g., fine-scale historical biogeography, 
cryptic speciation, rapid character evolution, etc.). 

The primer set and data reported here provide a starting 
point for other researchers to explore, modify, and utilize. The 
exon matrix now available (www.treebase.org, study accession 
number S2469) includes many conserved regions for primer 
design, and because the matrix includes taxonomic coverage 
for all higher-level harvestmen groups, it is easy to modify 
primers to match a group of interest. Furthermore, we have 
discovered an intron (intron III) that is most-parsimoniously 
reconstructed as being ubiquitous in Opiliones, with obviously 
conserved exon regions flanking this conserved position 
intron. With minor experimentation, it should be easy for 
researchers with access to standard PCR tools to generate 
both intron and exon EF-la data for Opiliones. 

The most conspicuous problem that we have identified is the 
presence of short simple sequence regions within introns, 
which because of apparent length heterozygosity, sporadically 
interrupt direct sequence reads. Here we would first recom¬ 
mend designing custom primers that span only a single intron 
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P 



P 





1 



3 





4 



1 





7 



5 





1 

Intron I 


1 

Intron II 



Siro acaroides 

CAG 

- GCT 

AGCCAG 


AGTCGA 

Sclerobunus nondimorphicus 



.A. . 



GC. . . 


Scotolemon lespesi 






GC. . . 


Astrobunus grallator 

. .A 


.A. . 

.A 


GC. . 

.T 

Ceratolasma tricantha 

. .A 

— 

.A. . 

.A 



.T 

Sabacon imamurai 




.A 


GC. . 


Dendrolasma dentipalpe 



.A. . 



GC .A, 

.G 

Sabacon cf. cavicolens OP660 

?? ? 

GT(78bp)AG ••• 

.AT. 

.A 


GC. . 

.G 

Sabacon cf. cavicolens OP1535 

? ? ? 

GT(78bp)AG ••• 

.AT. 

.A 


GC. . 

.G 

Sabacon cavicolens OP721 



.AT. 

.A 


GC. . 

.G 

Acuclavella cf. quattuor OP2233 


.A. . 

.A 


GC. . 

.T 

Acuclavella cf. merickeli OP2347 


.A. . 

.A 


GC. . 

.T 

Leiobunum aldrichi OP829 




.A GT (180bp) AG GCC. 

.T 

Leiobunum serratipalpe OP1080 




.A GT (180bp)AG GCC. 

.T 

Ortholasma rugosum OP807 





GT(109bp)AG 

— 

-G 


Ortholasma rugosum OP808 
Dendrolasma mirabile OPIOOO 
Sclerobunus nondimorphicus OP1056 
Sclerobunus r. robustus OP1164 
Bishopella laciniosa OP320 
Bishopella laciniosa OP108 
Siro cf. kamiakensis OP2350 


b 

P 


Intron III 

- GTATT 

- -C 


c 


5 

9 

5 

i 

GGAG 


T. 


C.GT(74bp) AG 
C.GT(67bp) AG 
C. GT(82bp) AG 
. . GT(73bp) AG 
. . GT(73bp) AG 
. . GT(6 0bp) AG 
. . GT(61bp) AG 
. . GT(106bp)AG 
. . GT(103bp)AG 
..GT(311bp)AG 
..GT(399bp)AG 
..GT(573bp)AG 
..GT(406bp)AG 
..GT(445bp)AG 
..GT(564bp)AG 


C 


C 

C 

C 

C 


Figure 4.—Intron/Exon boundaries. Sequences generated in this study are shown with OP numbers, all those without are from Shultz & 
Regier (2001). Exon nucleotides are shaded in grey. Base positions are shown relative to alignment with exon sequenees of Shultz & Regier (2001). 
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Figure 5.—Comparative mitochondrial (left) and EF-la (right) parsimony trees. Number of parsimony inferred changes shown adjacent to 
branches. TL = length of most-parsimonious tree. Bootstrap proportion values above 90 are indicated by black thick branches; proportion 
values between 70-90 indicated by grey thick branches. All searches resulted in a single most-parsimonious tree, except for the Leiobunum EF-la 
matrix, which resulted in 3 most-parsimonious trees (we illustrate only one of these). For mitochondrial trees, pairwise comparisons with 
divergence values between 2-3.2% (Kimura 2-parameter distances; Kimura 1980) are highlighted with bold, italicized text. 


(after preliminary exploration of intron structure for the taxon 
of interest). Depending upon the severity of length heterozy¬ 
gosity, it may be necessary to clone PCR products, which is 
obviously more expensive and labor-intensive, but is still a 
widely used lab method. Other alternatives include allele- 
specific PCR, or bioinformatics approaches, such as the use of 
software programs that allow phase determination of length 
variable alleles resulting from direct sequencing (see Flot et al 
2006; Flot 2007). 


The data reported here reveal general congruence between 
nuclear versus mitochondrial gene trees (Fig. 5). Gene tree 
congruence, or lack thereof, is a reflection of several 
interacting variables, including sampling density, the depth 
and divergence history of the group of interest, and patterns of 
gene evolution (see Maddison 1997). For example, older, well¬ 
spaced speciation events are expected to result in more gene 
tree congruence than more-recent, “compressed-in-time” 
radiations. Gene flow across species boundaries in recently 
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separated species can cause incongruence among gene trees, as 
can comparisons of paralogous gene copies in multi-gene 
families. 

Given our sample of taxa and exemplars within these taxa, 
we observed minimal incongruence, but again, this is an 
empirical issue that is expected to vary from group to group. 
In this context, one relevant question is how old are the 
harvestmen taxa within congeneric sets? We consider two 
indirect lines of evidence for estimating the ages of these 
groups. First, we have highlighted on Fig. 5 pairwise sample 
comparisons that differ by 2-3 percent for mitochondrial 
protein-coding genes. Applying a “standard” arthropod 
mitochondrial molecular clock (e.g., 2.3% pairwise per million 
years; Brower 1994; Pons et al. 2006) across trees for these 
individual groups would indicate divergences ranging from 1- 
8 million years ago (MYA). These are, of course, very rough 
estimates, with many assumptions which may or may not hold 
(a standard clock applies to Opiliones, no among-lineage rate 
variation, etc). Second, there is strong evidence for ancient 
vicariance within Acuclavella , leading to the primary phylo¬ 
genetic separation of Olympic and Cascade mountain 
populations (OP2345, OP2347) from all other populations in 
the Idaho Rockies. In many other taxa from this region (e.g., 
Carstens et al. 2004; Steele et al. 2005), this west/east 
vicariance results from the Cascadian orogeny at approxi¬ 
mately 5 MYA, which suggests a divergence time window 
generally consistent with the above molecular clock estimates. 

The above divergence time estimates suggest that EF-la 
data might be used to resolve divergences that are quite recent 
in absolute time. Although not extensive, all pairwise sample 
comparisons at the “one million year horizon” also show at 
least some divergence in EF-la (see Table 3). Furthermore, we 
note that our phylogenetic treatment of insertion and deletion 
(indel) variation in introns was very conservative, as several 
indel sites appeared phylogenetically informative. More 
sophisticated treatment of this class of variation is expected 
to further increase the phylogenetic informativeness of this 
gene region for recent divergences (e.g., Simmons & Ochoter- 
ena 2000; Kawakita et al. 2003; Benavides et al. 2007). 

In terms of gene tree congruence, one variable that seems 
not to be influencing our data is the presence of multiple gene 
copies, and potential mixing of paralogous copies. The 
paralogy/orthology issue is one of the most important 
problems negatively impacting the phylogenetic utility of 
nuclear protein-coding genes (reviewed in Sanderson & 
Shaffer 2002). Multiple gene copies are in fact known for 
EF-la in other arthropods (see Danforth et al. 1999), 
including spiders (Hedin & Maddison 2001). “Deep paral¬ 
ogy,” or the inadvertent comparison of divergent gene copies 
(e.g., gene copies resulting from gene duplication before the 
divergence of Opiliones), is apparently not an issue in our 
data, as mixing such paralogs would not result in the expected 
systematic relationships that we see in the exon-only analysis 
(Fig. 3). An alternative is that harvestmen species are carrying 
multiple copies of the EF-la gene that are kept similar by 
concerted evolution (as hypothesized for salticid spiders, 
Hedin & Maddison 2001); distinguishing such closely-related 
paralogs would be more difficult in a phylogenetic analysis. 

The only hint of such “cryptic” paralogy is the Ortholasma 
OP807 sequence, which requires a 2 amino-acid exon gap to 


align intron II splice site signal sequences (see Fig. 4). Such a 
gap is never present in other exon sequences, even from close 
phylogenetic relatives (e.g., Dendrolasma). This sequence may 
represent a paralog, but the exon appears functional (no stop 
codons), and is placed as expected phylogenetically. A perhaps 
more plausible explanation is that we have misaligned the 
intron boundary, but this requires non-canonical splice sites 
for this intron. More data is needed to distinguish between 
these alternative hypotheses. In general, there is little to no 
evidence for multiple copies of EF-la in harvestmen. 
Researchers in prior studies (Shultz & Regier 2001, 2009) 
have not suggested this problem, relationships inferred from 
exons provide expected phylogenetic results, and gene trees for 
congeneric sets are largely congruent with independent data. 
However, the possibility of such paralogy should not be 
dismissed or overlooked in future studies. 

Various technologies are now available for the generation of 
comparative nuclear sequence data. These include expressed 
sequence tag (EST) libraries, next-generation sequencing 
(Mardis 2008), development of anonymous nuclear loci (e.g., 
Noonan & Yoder 2009), etc. Ultimately, these technologies 
may supercede studies that focus on a single “candidate gene,” 
for example, if we can ultimately generate whole genomic data 
for all samples of interest. However, these technologies are still 
too expensive for the average systematist, particularly in 
geographic areas where resources for systematics research are 
limited. Until then, we argue that studies of the phylogenetic 
behavior of “candidate genes,” such as the research reported 
here, still have considerable utility in the arachnological and 
broader systematics communities. 
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